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ABSTRACT 

A computed line list for formaldehyde, H2^^C^®0, applicable to temperatures up to 
T = 1500 K is presented. An empirical potential energy and ab initio dipole moment 
surfaces are used as the input to nuclear motion program TROVE. The resulting 
line list, referred to as AYTY, contains 10.3 million rotational-vibrational states and 
around 10 billion transition frequencies. Each transition includes associated Einstein- 
A coefficients and absolute transition intensities, for wavenumbers below 10 000 cm“^ 
and rotational excitations up to J = 70. Room-temperature spectra are compared 
with laboratory measurements and data currently available in the HITRAN database. 
These spectra show excellent agreement with experimental spectra and highlight the 
gaps and limitations of the HITRAN data. The full line list is available from the CDS 
database as well as at www. exomol. com. 


1 INTRODUCTION 


Formaldehyde, H 2 CO, is a poisonous molecule in the aldehyde group. On Earth it plays a part in troposphere chemistry 
dynamics as the main source of OH via photo-diss ociation and is formed from photo-oxidation in the atmosphere or through 
the incomplete burning of biomass (Wayne 2000l l . Traces of formaldehyde have tentatively been detected in the martian 


atmo sphere ( Korablev et al. 1993h where it is believed to be derived from the oxidation of methene (C 2 H 4 ) 1 Villanueva et al, 

2OI3II . 


Formaldehyde was the f irst polyatom ic molecule to be detected in the interseller medium (ISM) I Zuckerman et al. 19701 1 
and is e xtremely abundan t Jh anger 197^ 1. This has made it useful in investigating the isotope composition of carbon in the 

S y (IZuckerman et al.lll974l l. The proposed mechanism of production is via the successive hydrogenation of CO llWoon 
on icy grain mantles: 


H + CO 
H + HCO 


HCO 

H 2 CO. 


( 1 ) 


Further hydrogenation produces methanol through an intermediate meth yl radical H+H 2 CO—CH 3 O—>■ H-|- CH 3 O ^ CH 3 OH. 
Common reactions include that with ammonia which produces amines I Schuttel 2 OO 2 II and polymerisation with other H 2 CO 
molecules. As a result, formaldehyde is believed to be the major pr ecursor for the for mation of com plex organic molecules in 
the ISM that include interstellar glycolaldehyde J Hollis et al. 200d 'l and amino acids (Schutte 2 OO 2 II . 


201c 


F ormaldehyde’s astrophysica l relevance does not end i n the ISM. Re cently, it has been detec ted in comets llBockele e-Morv an fc Crovisier 
1992), such as 103P/Hartley 2 ( Dello Russo et al IB), c/2007 N3 llVilIanueva et al.ll201 ll'l and Hale-Bopp dMilar^’eTlin 
200611 . where it is thought to originate from the degredation of polyoxymethylene (Cottin_^^^ 2001). It is also present 


Zasowski et al 


in protoplanetary d iscs around low mass young stars (Taurus-Auruga Class I/H) (lOberg et al 
Sargent et al.ll2014r i as circumstellar ice with an abundance ratio of « 2% compared to the more ubiquitous water-ice. 


2OO9I : 


Because of H 2 CO’s role as a precursor to complex organic molecules, it is consid ered a possible bio marker. The RNA 
world hypothesis suggests an early Earth with a CO 2 , H 2 O and N 2 rich atmosphere I Neveu et al. 20131 ). Illuminating this 
mix with ultraviolet (UV) radiation should lead to a large a mount of formaldehyde being fixed in the atmosphere before 
being deposited into the prebiotic oceans ( Neveu et al. 20131 ). Alternatively, the source of prebiotic chemical comp ounds 
may be derived without need of illuminating UV radiation via glancing icy body impacts ( Goldman fc Tamblvnll2013l) . Such 
impacts would produce shock-compression conditions that lead to formation of HCN molecules. These HCN molecules can be 
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2 Al-Refaie et al 


hydrolyzed to form formaldehyde and from there produce amino acids. Thus a planet rich in formaldehyde may indicate one 

undergoing the stages of pre-life. _ 

Finally, formaldehyde masers 1 Forster et al. 1980l : Pratap et al. 1992ll are a reliable and proven tracer for high-density 
environments such as star-forming regions in galaxies due to its ubi quity and large numb er of long wavelength transitions 
1 Mangum et al. 2008ll . Currently, there are 19 extragalactic sources llVIangum et al. 200d)_of these masers including IRAS 
18566 + 0408, which is notable for detection of the first H 2 CO maser flare ( Arw^t al.l 12007 1. Formaldehyde masers (and 
maser flares) have mostly been observed via the lio —>■ In and 2ii — 2 i 2 A-doublet transitions at 6.1 cm and 2.2 cm 
respectively. 

The wide-range of interactions in atmospheric, terrestrial, astrophysical and astrobiological phenomena makes formalde¬ 
hyde a relevant molecule in the chemistry of exoplanets and their atmospheres. Therefore a complete, high-resolution, line list 
for H 2 CO should provide an important aid for charact erisation and modelling of for maldehyde. These considerations led us 
to study formaldehyde as part of the ExoMol project I Tennyson &: Yurchenkol 2012l 'l. which aims to produce comprehensive 
molecular line lists for studies of the atmospheres of exoplanets and cool stars. 

High-resolution, room-temperature formaldehyde spectra hav e been well-studied in the laboratory 


1975; 


Brown et al 


1996; Theule et al 


1971 ; Nakanaga et al 


198i 


Nadler et al 


1987; 


Cline_^^^xghe^ 19881 : Reuter et al. 


Johns fc McKellar 

19891: 


Poulin et al. 


2003 ); the early work was reviewed bvIClouthier fc Ramsav ( lj83 ). Currently, the major source of publicly 


available spectroscopic data on H 2 CO is the HITRAN database (Rothman et al. 2013lf which has recently been updated to 
include long-wavelength data fr om the CDMS data base ( Muller et al.ll2005[ ). T he'^ectral regions covered in the database are 0 
- 100 cm“^, 1600 - 1800 cm“^ ( Perrin et al. 2009lf and the 2500 - 3100 cm“^ ( Perrin et al.ll200^ at up to 10~^® cm/molecule 
sensitivity for r=296 K. However, this accounts for only 40 000 transitions extending up to J = 64 and covers only four of the 
six fundamental vibrational bands as well as the ground state rotational spectrum. This deficiency arises from an apparent 


lack of absolute intensities in the 100 - 1600 cm ^ range. Additional observed tr ansitions are available_|Perrin_et_^ 


and include line p ositions ( Perrin et al. 20031 : Tchana et al. 2007 : Zhao et al] 2007|l. and intensities |P errin et al 


200 ! 


2003 


2001 


Flaud et al.ll2006l ) of some of the fundamental bands and hot bands (llto et al.ll 19941 : iPerez et al.ll2006l : iMargules et al.ll2009l f. 


The incompleteness and low rotational excitations available in HITRAN limits the applicability of this data for temperatures 
above 300 K. The theoretical spectra presented in this paper aims to provide a more complete and accurate picture of the 
spectra of formaldehyde up to 10 000 cm“^ and for temperatures up to 1500 K. Our line list should therefore be useful for 
modelling higher temperature environments as well as studies on non-LTE transiti ons such as those obser ved in masers. 

( 1996lf 


Theoretically, electric dip ole transition int e nsities of H 2 CO were studied by iLuckhaus et al 


and Carter et al. 


( 2 OO 9 II : see also the review by Yurchen^ (2014). Luckhaus et al. ( 1996l l used an ab initio MP2/6-311G** DMS to simulate 
the photoacoustic spectrum of high C-H stretching overtones of H 2 CO. Carter et al. ( 2009l ~) generated an ab initio couple- 
cluster CCSD(T)/aug-cc-pVTZ dipole moment surface (DMS) for H 2 CO; they used an eff ective charges represe ntation to 
compute (relative) rovibrational l ine in tensities for H 2 CO reproducing the HITRAN data ( Rothman et al. 20091 ') with rea¬ 
sonable agreement. Poulin et al. ( 19961 ) computed an ab initio DMS using the QCISD/6-31111G(d,p) level of theory and 
presented it as an expansion. 

Despite these works there is no comprehensive line list for formaldehyde availabl e in the literature. The goal of this work 
is to bridge this gap. We use the variational program TROVE (|Yurchenko_eLaL 2007 ') in conjunction with an initial potential 
energy surface (PES) obtained ‘spectroscopically’ by Yachmenev et al. ( 201lf) and a new ab initio dipole moment surface 
(DMS) for formaldehyde and generate an extensive line list for H2^^C^®0 applicable for the temperatures up to T = 1500 K. 
In the following H 2 CO and formaldehyde will refer to the main isotopologue H2^^C^®0. 


2 METHOD 
2.1 Background 


H 2 CO is a prolate asymmetric top molecule that belongs to the C 21 ; molecular symmetry group ( Bunker fc Jensen 1998l 'l. 
The group has four irreducible representations Ai, A 2 , B\ and H 2 . Once the H atom nuclear spin is taken into account the 
‘para’ A representations are singly degenerate and the ’ortho’ B representations are triply degenerate. As H 2 CO has four 
atoms, it has six vibrational modes; Table [T] shows the vibrational modes and their corresponding symmetries, band centers 
and descriptions. Coriolis in teractions occur strongly between the and vq modes, and weakly between the vs and Vi modes 
( Nakagawa fc Morinol 1971 ') which couples their energy levels and wavefunctions. This manifests itself in the 1 / 3 , 124 , and us 
mode interaction as overlapping bands which make these three bands difficult to distinguish from each other. 


2.2 Potential energy surface 


Full details of their PES calculation are given by lYachmenev et al.l (120111 ). so only a brief summary is presented here. The 


initial PES was computed ab initio using the CCSD(T)/aug-cc-pVQZ theory. Variational calculations with this surface give a 
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ExoMol line lists VIII: H 2 CO 3 


Table 1. Vibrational modes and observed band centres in cm ^ bv ICarter et alJ lll995ll . 


Mode 

Band Centers 

Symmetry 

Description 


2782.46 

Al 

symmetric C-H stretching 


1746.01 

Al 

C-0 stretching 

na 

1500.18 

Al 

symmetric O-C-H bending 

z /4 

1167.26 

Bi 

out-of-plane bending 


2843.33 

B 2 

asymmetric C-H stretching 


1249.10 

B 2 

asymmetric O-C-H bending 




Figure 1. Overview of the 1 ^ 3 , 1^4 and uq region between different PES used in production and PNNL-IR ijSharpe et al.|[20Q4^ cross- 
sections: (a) represents the over-refined PES (b) the ’better’ refinement. Note the difference in y-axis scaling in (a) to highlight structural 
features. 


root-mean-square (rms) error of 5.1 cm“^ for the fundamental band centers. lYachmenev et al.l ( 201 il l refined the ab initio PES 
using aV' — V + AV formnlation where V, AV and V' are the original ab initio, correction and refined PES respectively. The 
eigenfunctions of the original ab initio Hamiltonian, H = T + V, are used as the basis functions for the extended Hamiltonian 
H' = H + AV where AV is typically small and almost diagonal correction. AV is expanded in Taylor series and expansion 
coefficients are obtained in a variational least-squares fit to a high-resolution spectroscopic data using TROVE. The rms error 
against experimental energy levels with J ^ 5 of this semi-empirical PES, called H2CO-2011, is 0.04 cm“^. 

It should be noted however that the excellent accuracy of the refined PES caused serious problems with the absolute 
intensities of some bands. The intensities based on the initial, less accurate ab initio PES, the intensities of the u^, 1^4 and ve 
bands were observed to agree much better with the experiment. Figure[T][a) highlights this effect, where an order of magnitude 
difference in absolute intensity and cross-section was observed. Initially blamed on the DMS, it was soon discovered that the 
original ah initio PES did not have this problem. 

In order to address this issue we have repeated the refinement process making it less aggressive with careful observation of 
the transition moments. In the present work we have also increased the size of the basis set (see discussion below). We used the 
same fitting set of experimental energies as well as the same functional form for PES. The rms deviation of the experimental 
term values used in the PES fit (J ^ 5) against the computed energies in AYTY is 0.18 cm“^ (0.006 cm“^ for pure rotational 
term values). The potential parameters as well as the associated Fortran 95 program are given as supplementary material. 
The resulting line intensities have returned to quality of the ah initio levels as can seen in Figure [T](b). 


2.3 Variational computation 


The TROVE program suite (I Yurchenko et al.l 120070 is employed to compute our formaldehyde line list as well to perform 
the least-squares fit of the ab initio PES discussed above. TROVE is designed to compute variational ro-vibrational energy 
levels, associated eigenfunctions and transition intensities for molecules of arbitrary structure. Variational methods are often 
limited in their efficiency due to the need to diagonalize increasingly large Hamiltonian matrices as the complexity of the 
molecule increases. However with the improved computational power and parallelism of modern CPUs, it is now feasible to 
solve ro-vibrational Schrodinger equations for polyatomic molecules. TROVE has been succ essfully used to produce high - 
accur acy line lists and spectr a for te tratomic molecules such as HS OH ( Yachmene^_et_ah 201oll. NH.q ( Yurchenko et al. 2011 1. 
SO 3 ( Underwood et al. 2013ll. PH 3 JSousa-Silva et al. 20131 . 2015h . and SbHa I Yurchenko et al. 201ol l. as well as recently the 
pentatomic molecule CH 4 ( Yurchenko fc TennvsorJ 2014 1. 

TROVE approximates the kinetic energy operator (KEO) by a truncated Taylor series expansion in generalized coordi- 
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4 Al-Refaie et al 


nates. Previous calculations for H 2 O 2 by iPolvanskv et al.l (120131 1 show that computations using series represen tation of the 


KEO for non-linear molecules, such as H 2 CO, converge as well as exact KEO-based programs such as WAVR4 lIKozin et al. 


200411 and, particularly for excited rotational states, at cost of significantly less computational time. In this work we use a 


kinetic expansion order of 6 for optimal convergence while providing reasonable computation times and similarly, we use a 
potential energy expansion order of 8. 

In TROVE, the primitive vibrational basis set is represented by a symmetrized product of six one-dimensional vibra¬ 
tional functions (j>n^{r{), <j>n2{i'2), fling(r|), 4 >ni{d{), (jinA dy)-, and 4 >nAT), where rtj denotes the associate d local mode vibra¬ 
tional quanta, {rf, r 2 , r|, , @ 2 } are linearized versions ( Yurchenko et al. 2007 : Bunker fc Jensen lOOSl l of the coordinates 



( 2 ) 


the ID basis functions is contracted using the polyad condition: 

P = 2(n2 -I- ns) -(- ni + n4 -I- ns -I- ne < Pmax , 
which in terms of the normal mode quantum numbers vt reads 

P = 2{vi + Us) +V 2 +V 3 +V 4 +V 6 ^ ^max- (3) 

This polyad rule is based on the approximate relationship between the H 2 CO fundamental frequencies (see Table [T]): 

m ~ z/s ~ 2^2 ~ 2n3 « 2^4 ~ 2^6. (4) 

The vibrational basis set is further optimized by solving four reduced eigen-problems variationally for {(?i}, { 92 , 93 }, 
{ 94 , 95 }, and {qe} to produce four sets of wavefunctions ( 92 , 93 ), ( 94 , 95 ), and 4?l®g((j6), respectively. At 

the step 2 the pure vibrational (J = 0) problem is solved variationally using the basis set constructed as a symmetrized 
direct product of $1*] (i = 1,6) and ij,k = 2, 3 or 4, 5 ) contracted through the polyad number condition ([2]) and 

symmetrized according to the C 2 i,(M) molecular symmetry group. In this work the basis set is truncated at Pmax = 16 as the 
relative simplicity of the molecule means that this gives well-converged results. The maximum polyad number Pmax restricts 


the number of combinations of and for which P ^ Pmax. The resulting eigenfunctions obtained for each 

C 2 «(M) symmetry T = A i, A 2 , Pi and B 2 togeth er with the symmetrized rigid rotor wavefunctions |J, A, Trot) form our J = 0 
basis set representation 1 Yurchenko et al. 2009li . where the ro-vib rational basis fun c tions a re given as a direct product of 
and | J, K, Trot). Here Trot is the rotational parity defined by Yurchenko et al. ( 2005a 1. and K is th e projection o f the 
angular momentum on the body-fixed axis 2 :. The latter is defined according with the Eckart conditions ( Eckart 1935ll and 
is oriented approximately along the CO bond. In C 2 i,(M) symmetry, K and Trot correlate with the customary Ka and 
rotational quantum numbers as 

K = Ka, r,ot = mod(|Aa - Ka\, 2). (5) 

The vibrational part of the J = 0 basis set is truncated using the energy threshold of he 18 000 em'^and thus consists of 2310, 
1531, 1688, and 2112 functions for the A\,A 2 ,Bi and P 2 symmetries, respectively. 

The resulting ro-vibrational Hamiltonian matrix in the J = Q representation exhibits a block diagonal structure where 
each of four blocks represents an irreducible representation Ai, A 2 ,Bi or B 2 and can be diagonalised independently. Each of 
these blocks displays a band-diagonal structure whose bandwidth and length is determined by the J = 0 basis set size and 
the level of rotational excitation respectively. 

In generating our line list we employed an upper eigenvalue limit of 18 000 cm“^ as the intensity of transitions involving 
higher energy states are too weak to be important. The J = 0 matrix blocks produced by TROVE were on average dimensions 
1920 X 1920. The rule of thumb for the average size of a block for J ^ 1 is 1 920(2 J -|- 1). The largest J computed was 
J = 70 which required the diagonalisation of mat rices in the order of ~ 300 000 for eigenval ues and eigenvectors. The linear 
algebra libraries LAPACK ( Anderson et al. 1999l 'l and SCALAPACK ( Blackford et al. 1997 1 were employed to solve for the 
eigenvalues and eigenvectors. 


s(hl) 


2.4 Dipole moment surface and intensities 

Intensity computation requires high quality electric DMS. We u se an ab initio DMS computed at the CCSD(T)/aug-cc-pVQZ 
level of theory in the frozen-core approximation using CFOUR ( Harding et al. 2008h . Three symmetry-adapted projections of 
the dipole moment Cartesian components, fiAi , fJ-Bi , and , are given in the analytical representations with each component 
expanded in Taylor seri es (185 parameters in tot al) in terms of internal coordinates around the equilibrium configuration using 
the form developed by Yachmenev et al. ( 2013l l to represent the dipole moment of H 2 CS. These parameters reproduce the 
ab initio dipole moment values of the /taj , /4bi , and /tbj components with rms errors of 0.0002 Debyes for each component. 
The equilibrium value of our dipole moment is 2.3778 D (at r^o = 1.2033742 A, = 1.10377 A, 0och = 121.844°), which 
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ExoMol line lists VIII: H 2 CO 5 


can be compared to t he experimental value of the ground vibrational state dipole moment of p=2.3321(5) D measured by 
Fabricant et al. ( 1977 1. 

The eigenvectors, obtained by diagonalization, are used in conjunction with the DMS to compute the required linestrengths 
(and from that the Einstein-A coefficients and absolute intensities) of transitions that satisfy the rotational selection rules 

f - J" = 0, ±1, f + J" / 0 (6) 

and the symmetry selection rules 

Al A 2 , ^ B2- (7) 

The Einstein-A coefficient for a particular transition from the initial state i to the final state / is given by: 


STr^vff 

^if — ——(2Ji + 1) 




( 8 ) 


where Ji is the rotational quantum number for the initial state, h is Planck’s constant, Vif is the transition frequency 
[hcuif = Ef — Ei), and 5'* represent the eigenfunctions of the final and initial states res pectively, JXa is the electr onically 


averaged component of the dipole moment along the space-fixed axis A — X,Y, Z (see also lYurchenko et alJ l|2005bh l. From 
this the absolute absorption intensity is determined by: 

’ —hcUif' 


ni ^ = ^^9nsi2Jf + 1 ) 


( kT ) 


Quf, 


1 — exp 


kT 


(9) 


where k is the Boltzmann constant, T the absolute temperature and gns is the nuclear spin statistical weight factor. Q, the 
partition function, is given by: 


g = E5iexp(-^ 


( 10 ) 


where gi is the degeneracy of a particular state i with energy Ei. For H 2 CO, gt is gns{2Ji + 1) with pns = 1 for Ai and A 2 
symmetries and pns = 3 for Bi and B 2 symmetries. The transitions were computed using the energy limits he 8 000 and he 
18 000 cm“^ for the lower and upper states, respectively. 

Although diagonalisation of the Hamiltonian matrices is very demanding on computer resources, it is the calculation of 
the Einstein-A coefficients which dominates the actual computer time due the sheer number of these and the large size of the 
eigenvectors. Graphics processing units (GPU) were therefore employed to accelerate computation of the intensities. To do 
this required the development of a new algorith m to allow these fast b ut memory poor processors to be used efficiently. A 
paper discussing this will be published elsewhere IIAl-Refaie et al.ll2015n . 


3 RESULTS 


The line list produced, which we call AYTY, contains around 10 billion transitions with wavenumbers up to 10 000 cm 


The transitions are sorted in increasing transition frequency and then converted into the ExoMol format llTennvson et al. 


2 OI 3 II . An extract of the state file and transition file can be seen in Tables [2] and [3] Spectra at arbitrary temperatures can 
be computed using the Einstein-A coefficients from the transition files. The theoretical error as estimated by the fitting rms 
deviation of 0.18 cm“^. This means our transition frequencies and energy levels should be reliable to about 0.2 cm—1 with 
low-lying levels, particularly the pure rotational ones, being much more accurate than this and levels for vibrational states 
for which there are no available laboratory data much less so. 

The completeness of the line list as a function of temperature can be determined by checking the convergence of the 
temp erature-dependent partition function Q given in Eq. (IIOII , which is computed via explicit summation (IVidler fc Tennyson 
2 OO 0 I 1 of the 10.3 million energy levels available. As T increases, a greater proportion of these states are required as their 
contribution towards Q becomes more important. Figure [2] shows our computed partition function as a function of the 
maximum J value (Jmax) used in the calculation. As Jmax increases, each J contributes progressively less until convergence 
is reached. The partition function at T = 296 K converges to better than 1% at J « 34 with the limit of Q = 2 844.621 at 
J = 58. For = 1 500 K, it converges to about 0.005% at J = 70 with a Q value of 130 190.25. These partition functions can 
be used to evaluate the effect of lower energy state threshold of 8000 cm“^ on the completeness of the line list by comparing 
Qiimit, which sums energies up to this threshold, with the full partition sum. Figure [3] shows that the two partition functions 
are essentially the same up to 800 K and that Qumit is 92.3 % of Q at T = 1500 K. Therefore we recommend r=1500 K as 
a ‘soft’ limit to the applicability of the line list. Use of the line list at higher temperatures will lead to th e progressive loss of 
opacity although the ratio Qumit/Q can be used to estimate the proportio n of this missing co ntribution ( Neale et al. 1996h . 

Table [5] compa res our partition functions with those from CDMS ( Muller et al. 20051 1 and those used in HITRAN 
( Fischer et al. 20031 ). At temperatures T ^ 300 K we agree to better than 1% with CDMS and HITRAN. At 500 K the 
difference with CDMS is much higher at 8.9%, due our explicit sum running over a much larger number levels, but agreement 
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6 Al-Refaie et al 


Table 2. Extract from the H 2 CO state file. The full table is available at http: //cdsarc .u-strasbg. f r/cgi-bin/VizieR?-source=J/MNRAS/ 


/ 

E ,cm ^ 

9 

J 

Ftot 

Vi 

V2 

V3 

V4 

^5 

VQ 

Fvib 

K 

Trot 


\cf\ 

ni 

U2 

ns 

n4 

U5 

UQ 

1 

0.000000 

1 

0 

1 

0 

0 

0 

0 

0 

0 

1 

0 

1 

1 

0.99 

0 

0 

0 

0 

0 

0 

2 

1500.120955 

1 

0 

1 

0 

0 

1 

0 

0 

0 

1 

0 

1 

2 

0.92 

0 

0 

0 

0 

1 

0 

3 

1746.045388 

1 

0 

1 

0 

1 

0 

0 

0 

0 

1 

0 

1 

3 

0.92 

1 

0 

0 

0 

0 

0 

4 

2327.497142 

1 

0 

1 

0 

0 

0 

2 

0 

0 

1 

0 

1 

4 

0.97 

0 

0 

0 

0 

0 

2 

5 

2494.322937 

1 

0 

1 

0 

0 

0 

0 

0 

2 

1 

0 

1 

5 

0.96 

0 

0 

0 

1 

1 

0 

6 

2782.410921 

1 

0 

1 

1 

0 

0 

0 

0 

0 

1 

0 

1 

6 

0.97 

0 

0 

1 

0 

0 

0 

7 

2999.006647 

1 

0 

1 

0 

0 

2 

0 

0 

0 

1 

0 

1 

7 

0.84 

0 

0 

0 

1 

1 

0 

8 

3238.937891 

1 

0 

1 

0 

1 

1 

0 

0 

0 

1 

0 

1 

8 

0.70 

1 

0 

0 

0 

1 

0 

9 

3471.719306 

1 

0 

1 

0 

2 

0 

0 

0 

0 

1 

0 

1 

9 

0.83 

2 

0 

0 

0 

0 

0 

10 

3825.967015 

1 

0 

1 

0 

0 

1 

2 

0 

0 

1 

0 

1 

10 

0.86 

0 

0 

0 

0 

1 

2 

11 

3936.435541 

1 

0 

1 

0 

0 

1 

0 

0 

2 

1 

0 

1 

11 

0.73 

0 

0 

0 

3 

0 

0 

12 

4058.101422 

1 

0 

1 

0 

1 

0 

2 

0 

0 

1 

0 

1 

12 

0.87 

1 

0 

0 

0 

0 

2 

13 

4083.490190 

1 

0 

1 

0 

0 

0 

0 

1 

1 

1 

0 

1 

13 

0.69 

0 

1 

0 

1 

0 

0 

14 

4247.609826 

1 

0 

1 

0 

1 

0 

0 

0 

2 

1 

0 

1 

14 

0.79 

1 

0 

0 

1 

1 

0 

15 

4256.314862 

1 

0 

1 

1 

0 

1 

0 

0 

0 

1 

0 

1 

15 

0.90 

0 

0 

1 

0 

1 

0 

16 

4495.499848 

1 

0 

1 

0 

0 

3 

0 

0 

0 

1 

0 

1 

16 

0.76 

0 

0 

0 

1 

2 

0 

17 

4529.635737 

1 

0 

1 

1 

1 

0 

0 

0 

0 

1 

0 

1 

17 

0.90 

1 

0 

1 

0 

0 

0 


I: State counting number; 

E: State term energy in cm~^; 
g\ State degeneracy; 

J: State rotational quantum number; 

Ftot: Total symmetry in (1 is Ai, 2 is A 2 , 3 is Bi and 4 is B 2 ); 

v\ — vq\ Normal mode vibrational quantum numbers; 

Tvib- Symmetry of vibrational contribution in C 2 u{M)\ 

K: State projection of the rotational quantum number; 

Trot- Symmetry of rotational contribution in C 2 v{M) ; 

/j r- State number in J, P block; 

|C^|: Largest coefficient used in the assignment; 

— '^6- TROVE vibrational quantum numbers. 


Table 3. Extracts from the H 2 CO transitions file. The full table is available at http: //cdsarc .u-strasbg. fr/cgi-bin/VizieR?-source=J/MNRAS/ 


f 

i 

Afi 

6713828 

6734990 

8.2910e-06 

6709468 

6722660 

3.2621e-05 

6704996 

6726710 

4.7333e-05 

6726711 

6739070 

5.0697e-05 

6718218 

6730865 

5.4273e-05 

6730866 

6750469 

5.6752e-05 


/: Upper state counting number; 
i: Lower state counting number; 
Afi'. Einstein-A coefficient in s“^. 


with HITRAN is good. There are bigger differences at higher temperatures: at 1500 K our partition function is lower by 
about 1.2% and at 3000 K by 9.7%. This ma y be caused by the lack of t he high energy con t ributi ons due to the energy 
cut-off of /iclSOOO cm‘^ used in our line list, see Sousa-Silva et al. ( 2014 ) and Neale Tennyson ( 19951) for a discussion of the 
importance of contributions from the excited ro-vibrational states up to the dissociation. Our full partition function evaluated 

on a 1 K grid is given in the supplementary data. _ 

We use the analytical representation suggested by Vidler &: Tennyson ( 2000l) as given by 


8 

logioQ(T) = E«"[logio2^r- (11) 

n=0 

The expansion parameters given in Table U reproduce our partition function better than 0.3% for temperatures ranging up 
to 3000 K. 

The dependance of the cross-sections on temperature is illustrated in Figure [T] the features in the simulated spectra 
become smoother as the temperature increases. This is a result of the vibrationally excited states becoming more populated 
and the increasing width of the rotational envelope with temperature. Figure [5] shows a simulated T — 296 K spectrum 
computed from our line list against the available laboratory absorption spectra up to 10 000 cm~^.The logarithmic scale used 
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Figure 2. Partition functions at two temperatures as a function of inclusion of rotational states: all J up to Jmax for T = 296 K (left 
hand scale) and T = 1500 K (right hand scale). 



Table 4. Parameters used to represent the partition function, see Eq. CD, valid for temperatures up to 3000 K. 


Parameter 

Value 

ao 

1.12789807683 

ai 

-5.35067939866 

a2 

10.33684323700 

as 

-4.92187455147 

a4, 

-2.28234089365 

as 

3.61122821799 

ae 

-1.64174365325 

a? 

0.33727543206 

as 

-0.02654223136 


shows the density of transitions in our line list and reveals the significant gaps and limitations in the HITRAN 2012 database. 
Comparing specihc regions, our line list accurately replicates both the line positions and intensities of the three available 
bands, as illustrated in detail in Fig. [6] Additional lines are present as our computed spectra contains all possible transitions 
withi n the region i nclud ing hot bands. Fig. [6ji and Table [7| show agreement with the line positions and absoulte intensities 
from R.euter et al. 1 lOSOl l with an rms deviation of 0.099 cm~^ for the line positions. There are some limitations with our line 
list. Higher J transitions at around the J > 50 range begin to show a slight drift of « 0.3 cm~^ in predicted line position; this 
does not occur for the rotational band. In practice, errors in the ro-vibrational energy levels grow with K (as opposed to J); 
the discrepancies in transition frequencies become more pronounced in \K' — K''\ — 1 transitions than those that involve the 
same K {K' = K”). This can be seen in the lack of drift in the pure rotational band as it is mostly comprised of K' = K” 
transitions due to both ground and excited states being of A\ symmetry. B\ and B 2 vibrational bands however are mostly 
comprised of \K' — K"\ = 1 transitions which makes their errors more sensitive to the quality of the model. 

Computing band intensities requires simulating spectra at a chosen temperature and accumulating all transitions that 
correspond to the specific band. Table [6] highlights our band intensities against those available in the literature. Each band 
intensity required spectra simulated to the parameters used by each reference. In general, AYTY agrees well with all band 
intensities but is more intense. This may be due to the fact that AYTY sums over all lines in a given band whereas experiments 
generall y only captur e the st rongest lines. Table [6] also shows the total band intensity for the 3.5 /rm region compared to 
that by Brown et al. (197^); Nakanaga et al. Jl9^ ') and HITRAN. Our value is 13 % str onger than HITRAN, (matches 
the disc repancy for t h e vi and bands in Table [U, 18 % stronger than Nakanaga et al. ( 1982h and 40 % stronger than 
that by Brown et al. ( 1979ll . Absolute intensities and bands not available in the HITRAN database or literature can be 
evaluated against cross-sections. For the V3, 1/4 and bands, further evaluation of these bands can be made against cross- 
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Figure 3. Plot of Qumit/Q against temperature where Qumit is the partition function computed using only energy levels below our 
lower state threshold of 8000 cm~^. 


Table 5. Compar isons of H 2 CO part ition functions as a function of temperature for this work, CDMS llMuller et a and those 

used in HITRAN llFischer et aljEoO.'ji . 


T / K 

AYTY 

CDMS 

HITRAN 

2.725 

2.0165 

2.0166 


5.000 

4.4833 

4.4832 


9.375 

13.801 

13.8008 


18.75 

44.6835 

44.6812 


37.5 

128.6581 

128.6492 


75 

361.7053 

361.7195 

362.07 

150 

1019.9549 

1019.9706 

1020.47 

225 

1874.4679 

1872.6221 

1875.67 

300 

2904.1778 

2883.0163 

2906.32 

500 

6760.2315 

6208.3442 

6760.99 

1500 

128635.40 


130190.25 

3000 

2741283.3 


3038800.0 


sections available from the PNNL-IR database ( Sharpe et al.lliooi ) and Nakanaga et al. ( 1982ll using a Gaussian profile with a 
HWHM (half-width-half-maximum) of 1.1849 cm~^ and 0.1120 cm~^, determine d from their resp e ctive experimental profiles. 
Figure [TK compares the AYTY line list with a spectrum extracted from Fig. 3 of iNakanaga et al.l (|l982r ) and scaled to match 
the AYTY line list. Good agreement is seen in both structure and position in the band with a slight drift occurring as an 
artifact from the extraction process. Figure [TId shows an even better agreement with the spectral structure as well as the 
cross-section intensity. 

The total integrated cross-section over the region 6.2 - 10.5 /rm for AYTY and PNNL is 8.02 x 10“^^ cm/molecule 
and 8.20 x 10“^^ cm/molecule respectively, making PNNL overall around 8% stronger. PNNL covers regions beyond those 
currently available in HITRAN. Figure [8]) a) depicts the 2 v2 band at 2.88 /rm. Good agreement is seen in structure, position 
and cross-sections with the integrated cross-sections differing by only 10%. The regions below 2.8 /rm in PNNL become 
increasingly polluted with noise but band features are still visible as seen in Figures [8Db),(c) and (d). In particular. Figure 
STb), the AYTY c ross-section reproduces peaks in features present in the PNNL-IR data. This region was also studied by 


Flaud et al. 1 2006h . Their absorbance spectrum produces certain transitions with double the intensity compared to AYTY. 
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Figure 4. Cross-sections of the entire AYTY line list as a function of temperature: The curves in the 16 /rm region increase in opacity 
with increasing temperature. 
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Figur e 5. Overview of our synthetic spectrum at T =296 K against HITRAN iRothman et al.l2013^ . lReuter et al.l lll989b and I Zhao et al.l 
ll2007h . 
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Figure 6 . The fu ndamental bands compared to currently available experimental intensities llRothman et al .112^0131 : iReuter et alj[l989l : 
iMiiller et al.|[2Qo3^ at T = 296 K. (a) Rotational Band (b) V 2 (c) v\ and 1/5 (d) 1 / 3 , 1^4 and uq . 
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Figure 7. Cross-section comparison of AYTY against experimental of the 1 ^ 3 , 1^4 and uq band regions: fa‘) iNakanaga et alJ l|l982l') at 296 
K with HWHM = 1.1849 cm~^; Extracted from image and scaled to match the AYTY cross-section; (b) PNNL-IR data at 323.15 K 
llSharpe et al.|[2003l with HWHM = 0.1120 cm~^. 
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2.05 2.1 2.15 2.2 2.25 2.3 2.35 




Figure 8 . Additio nal bands in PNNL at wavelengths below 3.2 fim with HWHM at 0.1120 cm“^. (a) 2u2 band; (b) Bands covered by 
iFlaud et al.l ll2006l l: (c) 3u2', (d) Various bands including ui + 1 / 5 . Note: (c) and (d) Negative PNNL values have been removed. 


These are due to splitting caused by two transitions with the same quanta but with swapped T / and Ti giving the two lines 

very similar transition frequencies and absolute intensity which make them difficult to resolve experim entally. _ 

Further bands include the integrated cross-section for the 21/5 band at 5676.21 cm~^ for AYTY and iBarrv et al.l ( 20020 at 
6.4 X 10“^^ cm/molecule and 5.6 x 10“^^ cm/molecule respectively making AYTY 11% stronger. In Table[B]we also compare 
theoretical ( AYTY) overt o ne ba n d intensities obtaine d by the direct summation with the correspon ding experim e ntally derived 
values from Perrin et al. llioTO): Fla ud et al.l 1 20061 ). The agreement with the data obtained by Fland__et_al. (2006 ) is very 


XI. y . x. iiv_. cxgjX vvxuxi uxxv../ v-itxutx 

good . Those from Perrin et al. |2006l) a re in fact a compilation of differe nt sources i Hisatsune fc EggersI 1955 : Brown et al. 


I 979 I : Nakanaga et al. 19821 : Herndon et al. 2005 : Cline fc Varghes^ 19881) . some of which were obtained at low resolution, 


which could explain the slightly worse agreement with our calculations. Compare the total total integrated band intensity for 


the band at 1.5 /rna we obtain 3.11 x 10 ' cm/molecule against 2.19, 2.62, and 2.73 xlO ^ cm/ molecule bv iPerrin et al. 


1 20061) . Nakanag^^^ffi_|l^^) and from HITRAN, respectively. 


Finally, 


Ito et al 


( 199J) presented the relative band intensities as the ratio of the vibrational transition moments between 


2j/ 4 and 2r'6 of 0.755(48), which can be compared to our absolute value of 0.6264. 


4 CONCLUSION 


We have computed the frequency and Einstein-A coefficients of almost 10 billion transitions of formaldehyde, which cover wave¬ 
lengths longer that 1 /rm and includes all rotational excitations up to J = 70, making the line list applicable for temperatures up 
to 1500 K. The AYTY line list gives a room-temperature spectrum in excellent agreement with available experimental data. We 
have highlighted those regions missing from the HITRAN database with the hope that they will be investigated further exper¬ 
imentally. The new line list may be accessed via www.exomol.com or http://cdsarc.u- strasbg.fr/viz -bin/qcat7J/MMRAS/ 
The cross-sections of H 2 CO can be also generated at www.exomol.com as described by Hill et al. 1 2013h . 
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Table 6. Band intensities, in 10 cm ^/(molecule cm ^). 


Band 

Ref. 

Obs 

AYTY 

(0-A)/0 (%) 


a 

1.008 

1.057 

-4.9 


a 

1.219 

1.348 

-10.6 


b 

0.184 

0.185 

-0.5 

1Z4 

b 

0.069 

0.089 

-27.8 


a 

1.120 

1.282 

-14.6 

1^6 

a 

0.173 

0.204 

-17.9 

y-i + Vi 

c 

0.0025 

0.0019 

22.7 

V2 + ve 

c 

0.0790 

0.1222 

-54.6 

2v3 

c 

0.0260 

0.0428 

-64.5 

V 2 + V4 

c 

0.1100 

0.1379 

-25.4 

V 3 + V6 

c 

0.1940 

0.3274 

-68.8 

V3 + V4 

c 

0.0290 

0.0300 

-3.4 

2vq 

c 

0.0220 

0.0214 

2.9 

V 4 + ve 

c 

0.0062 

0.0014 

77.7 

2t^4 

c 

0.0060 

0.0047 

22.4 

Vl + UQ 

d 

0.0015 

0.0022 

-45.0 

V2 + V4 + UQ 

d 

0.0006 

0.0007 

-4.6 

V 3 + V 3 

d 

0.0097 

0.0098 

-1.2 

2v3 + ve 

d 

0.0036 

0.0027 

24.4 

V 2 + V3 

d 

0.0377 

0.0446 

-18.2 

2v2 + ^'6 

d 

0.0108 

0.0123 

-14.0 

VI + V 2 

d 

0.0243 

0.0275 

-13.2 

3 v 2 

d 

0.0022 

0.0026 

-21.4 


Perrin et al. 

20091) 

^iPerrin et alj (l2003l) 

Perrin et al 

|20od) 

‘^iFlaud et al.l 1 I 2 OO 6 I') 
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Table 7. Residuals, in cm for line positions for the 1/3, 1/4 and t'g bands. Observed data from iReuter et ^ lll989l V 


Band 

J' 

J" 

AYTY 

Obs. 

Obs.-Calc. 

6 

17 

18 

1148.4322 

1148.3346 

-0.0976 

6 

17 

18 

1148.4578 

1148.3600 

-0.0978 

4 

11 

10 

1148.4115 

1148.3453 

-0.0662 

4 

3 

4 

1148.5548 

1148.4702 

-0.0846 

4 

16 

16 

1148.6150 

1148.5082 

-0.1068 

4 

1 

1 

1159.2222 

1159.1356 

-0.0866 

4 

2 

2 

1159.3587 

1159.2716 

-0.0871 

4 

28 

28 

1159.3222 

1159.3070 

-0.0152 

4 

15 

14 

1159.4760 

1159.3917 

-0.0843 

6 

6 

7 

1159.5539 

1159.4132 

-0.1407 

4 

9 

8 

1159.5115 

1159.4396 

-0.0719 

4 

3 

3 

1159.5594 

1159.4715 

-0.0879 

4 

18 

18 

1172.4242 

1172.3864 

-0.0378 

4 

6 

6 

1172.6002 

1172.5256 

-0.0746 

6 

12 

13 

1180.6607 

1180.6446 

-0.0161 

6 

4 

5 

1180.8080 

1180.7328 

-0.0752 

4 

24 

23 

1180.8209 

1180.8082 

-0.0127 

4 

11 

11 

1180.8777 

1180.8324 

-0.0453 

6 

13 

14 

1180.9109 

1180.8834 

-0.0275 

6 

10 

10 

1192.6923 

1192.6086 

-0.0837 

6 

3 

4 

1192.6678 

1192.6267 

-0.0411 

6 

9 

9 

1192.7477 

1192.6657 

-0.0820 

6 

8 

8 

1192.7985 

1192.7181 

-0.0804 

6 

18 

19 

1192.7781 

1192.7369 

-0.0412 

6 

7 

7 

1192.8441 

1192.7651 

-0.0790 

6 

10 

11 

1192.7723 

1192.7954 

0.0231 

6 

6 

6 

1192.8845 

1192.8067 

-0.0778 

6 

5 

5 

1192.9194 

1192.8427 

-0.0767 

6 

19 

18 

1440.3351 

1440.1330 

-0.2021 

6 

20 

19 

1442.4701 

1442.2633 

-0.2068 

6 

17 

16 

1460.4035 

1460.1831 

-0.2204 

6 

18 

17 

1462.5117 

1462.2863 

-0.2254 

6 

20 

19 

1466.6771 

1466.4415 

-0.2356 

6 

22 

21 

1470.7763 

1470.5289 

-0.2474 

3 

24 

25 

1442.2597 

1442.2329 

-0.0268 

3 

22 

23 

1446.1622 

1446.2088 

0.0466 

3 

22 

23 

1447.1125 

1447.1250 

0.0125 

3 

19 

20 

1453.7010 

1453.7073 

0.0063 

3 

20 

21 

1453.7236 

1453.7154 

-0.0082 

3 

17 

18 

1458.7043 

1458.7231 

0.0188 

3 

17 

18 

1458.7141 

1458.7325 

0.0184 

3 

1 

2 

1495.2715 

1495.3254 

0.0539 

3 

1 

0 

1502.5583 

1502.6118 

0.0535 

3 

9 

9 

1502.6600 

1502.6548 

-0.0052 

3 

11 

11 

1502.9248 

1502.9188 

-0.0060 

3 

5 

4 

1512.6148 

1512.6595 

0.0447 

3 

5 

4 

1512.6741 

1512.7189 

0.0448 

3 

6 

5 

1516.3213 

1516.3326 

0.0113 
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